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CODEBOOK SEARCH METHOD IN CELP VOCODER 
USING ALGEBRAIC CODEBOOK 

Field of the Invention 

5 

The present invention relates to a method for searching a 
codebook in a code excited linear prediction (CELP) vocoder 
using an algebraic codebook; and, more particularly, to a 
method for reducing codebook searching times when a depth 
10 first tree search method is used in algebraic code excited 
linear prediction (ACELP) vocoding using an algebraic codebook. 

Description of Related Art 

15 A technology for transmitting voice in digital has become 

widespread in a wired communication such as a telephone 
network, wireless communication and voice over Internet 
protocol (VoIP) network. It, in turn, has created interest in 
determining the least amount of information which can be sent 

20 over the channel while maintaining the perceived quality of 
the reconstructed speech. 

If voice is transmitted by simply sampling and digitizing, 
a data rate of 64 kilobits per second (kbps) is required. 
However, the data rate for transmitting can be reduced by 

25 using voice analysis and appropriate coding method. 

A vocoder is a device for compressing voice by extracting 
parameters that relate to a model of human voice. The vocoder 
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includes an encoder and a decoder • The encoder analyzes the 
incoming voice so as to extract the relevant parameters. The 
decoder re-synthesizes the voice using the parameters received 
over a channel, such as a transmission channel. 
5 A linear-prediction-based time domain vocoder is the most 

popular type of the vocoder. The linear-prediction-based 
technique extracts the correlation between the input voice 
samples and past samples, and encodes only the uncorrelated 
part. The function of the vocoder is to compress the 

10 digitized voice signal into a low bit rate signal by removing^ 
all of the natural redundancies inherent in the voice. The 
voice typically has short term redundancies due primarily to 
the filtering operation of the lips and tongue, and long term 
redundancies due to the vibration of the vocal cords. In a 

15 code excited linear prediction (CELP) coder, two filters, a 
linear predictive coding (LPC) filter and a pitch filter are 
used for modeling the voice. The LPC filter receives noise- 
like signal and is excite by a voiceless sound. Also, the LPC 
filter receives a quasi periodic input and is excited by a 

20 nasal sound and a vowel. Once these redundancies are removed, 
the resulting residual signal is modeled as white gaussian. 
noise or multi-pulse according to a kind of CELP coding and 
encoded . 

The CELP algorithm has been introduced for effective 
25 coding. The CELP vocoding at a rate of 4 to 8 kbps guarantees 
almost same quality of vocoding using other vocoders at 32 
kbps. The CELP vocoder has two advantages. First, the CELP 
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vocoder detects more detailed voice signals by extracting 
pitch information using a pitch predictor. Second, the CELP 
vocoder excites the LPC filter by using noise-like signals 
generated from residual signals generated from actual voice 
5 signals. 

The CELP algorithm has been broadly used for voice 
compression at a low bit rate while guaranteeing good quality. 
The CELP algorithm is applied to fields of cellular 
communications, satellite communications and digital voice 
10 storages. 

A stochastic codebook has been applied to the early CELP 
algorithm as a codebook. The stochastic codebook includes N 
number of sample codes. However, it takes long time to search 
the codebook because an analytic synthesis method by the CELP 

15 algorithm is used. Lately, searching time has been reduced by 
using a stochastic codebook based upon a linear combination of 
a small number of basic vectors. However, it still takes long 
time to search a codebook and large storage unit is required. 

For overcoming above mentioned problem, an algebraic 

20 codebook has been introduced. An algebraic CELP (ACELP) 
algorithm is a CELP algorithm using the algebraic codebook and- 
has been selected to many speech coding standards, e.g., 
global system for mobile communication-enhanced full rate 
(GSM-EFR), enhanced variable o rate coder (EVRC) and adaptive 

25 multi-rate (AMR) . The ACELP algorithm does not need a large 
storage unit for the codebook because the codebook is not 
required. Because of its effective searching method, the 
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ACELP algorithm needs less computation amount in searching the 
codebook comparing to the CELP algorithm. 

A limit of error to a target signal is minimized for 
searching a location and a magnitude of a pulse of an excited 
5 signal in the ACELP algorithm. It results large computation 
amount. Therefore, a focused search method and a depth first 
tree search method are used in the ACELP algorithm so as to 
reduce the computation amount . 

The focused search method in G.729 codec limits a 

10 searching range by using a thresh-hold value. The depth first 
tree search method in G.729A searches only branches that 
satisfy a local maximum. 

Fig.l is a block diagram showing encoding procedures of 
an ACELP vocoder using a typical algebraic codebook. 

15 As shown, a typical ACELP vocoder uses 20 millisecond 

(ms) speech frames for coding and decoding. In each 2 0ms 
interval, the encoder processes 160 samples of speech. The 
typical ACELP vocoder extracts pomant information, pitch 
information and codebook information that shows 

20 characteristics of voice signal. At step 10, DC components of 
input voice signals are removed by a high pass filter and a^ 
10^** order coefficients of linear predictive coding (LPC) is 
computed by using a 30 millisecond (msec) asynchronous window 
and a Levinson-Durbin algorithm. At step 11, the LPC 

25 coefficients are transformed into line spectral pair (LSP) 
coefficients that have good linear interpolation 
characteristics, small quantization distortions and small 
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transmitting errors. At step 12, the LSP coefficients are 
quantized. 

The LPC parameters are interpolated into adequate LPC 
parameters for pitch searching and codebook searching. 
5 The pitch searching is divided into a step of open-loop 

searching and a step of closed-loop searching. At step 13, a 
value of pitch delay is determined by the open-loop searching. 
At step 15, an impulse response is computed. At step 16, a 
target signal x(n) is computed and zero input responses from 
10 input voice signals is removed. At step 14, an exact value of 
pitch delay is determined by the closed-loop searching. The 
value of pitch delay has the least mean square error to the 
target signal. 

At step 17, a target signal X2(n) for algebraic codebook 
15 searching and the pitch signal is removed from the target 
signal x(n) . At step 18, a location and a sign of the pulse 
is determined while the input voice signal has the least mean 
square error to the target signal X2(n). Sub-frames of the 
algebraic codebook include a plurality of tracks. A 
20 predetermined number of pulses are allocated to each track to 
model excited signals of the sub-frame effectively. Also, 
magnitudes of pulses are fixed to ±1 to reduce computation. 
Finally, algebraic codebook information includes a location 
and a sign of pulses allocated in each track. 
25 The mean square error between the input voice signal and 

the synthesized voice signal is expressed as following Eq. 1. 
Algebraic codebook searching in the ACELP algorithm is a 
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process of finding pulses of the excited signals by minimizing 
a value obtained by Eq, 1. 



e,=\\X-gHc, 



[Eq. 1] 
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Referring to Eq. 1, ^ is a target signal from which a 
predicted gain of an adaptive codebook is removed and g is a 
codebook gain. H is expressed as h^h and is a lower 
triangular toepliz convolution matrix that is generated from 
the impulse function of weighted synthesis filter. Ck is an 
algebraic code vector. 
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[Eq.2] 



is an impulse response and a magnitude of a sub- 
15 frame, n is 40. Eq. 1 can be described as following Eq. 3. 



c'.H'Hc, 



[Eq.3] 



An optimal code vector can be determined from Eq. 3 by 
20 maximizing a result of following Eq. 4. 
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[Eq.4] 



is a signal that shows correlation between the target 
signal x(n) and the impulse response h(n). d is called a 
reverse filtered target signal and is expressed as: d-H'x. x 
is a target signal from which a predicted gain of an adaptive 
codebook is removed. <l)is a correlation matrix of h(n) and is 
expressed as: 0 = 7/'//. 

A numerator of Eq. 4 can be described as below Eq. 5 
because an algebraic code vector includes small number of 
pulses that are non-zero. 



m. is an i^ location of a pulse, s. is a sign of a pulse 
and is the number of pulses. 

A denominator of Eq. 5 can be described as below Eq. 6. 




[Eq.5] 




[Eq.6] 



d{ri) and are computed in advance in Eq. 6 to reduce 

computation amount. irij is j^^ location of a pulse. The 
focused search method and the depth first tree search method 



are used in the ACELP algorithm so as to reduce computation. 

A thresh-hold value is computed in advance to simplify 
the search process in the focused search method. However, if 
the number of pulses is increased, the implementation of the 
5 focused search method becomes difficult. 

The depth first tree search method is modified method of 
the focused search method and searches branches that satisfy a 
local maximum. 

The depth first tree search method is applied to the GSM- 
10 EFR codec. When 10 pulses are chosen from 40 pulses in the 
GSM-EFR codec, a combination is 4oCio=847*10^ times. However, 
when the depth first tree search method is applied in the GSM- 
EFR codec, the number of search is 4* ( 4* ( 8*8 ) )=1024 times. 

However, a predetermined number of pulses are allocated 
15 to each track to model excited signals of the sub-frame 
effectively in the algebraic codebook. Also, magnitudes of 
pulses are fixed to ±1 to reduce computation. 40 sub-frames 
are divided into 5 tracks and each track uses two pulses in 
the GSM-EFR codec. 

20 



[Table 1] 



Track 


Pulse 


Position 


1 


i0,i5 


0,5, 10, 15,20,25, 30, 35 


2 


il,i6 


1,6, 11, 16,21,26, 31, 36 


3 


12 , 17 


2,7,12,17,22,27,32,37 


4 


13,18 


3,8,13,18,23,28,33,38 


5 


14,19 


4,9,14,19,24,29,34,39 



Although the number of search in the GSM-EFR codec is 
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reduced to 1024 times by using the depth first tree search 
method, the computation amount for searching is still large 
and takes 4 0% of total computation amount. 

5 Summary of the Invention 

It is, therefore, an object of the present invention to 
provide a method for searching algebraic codebook having small 
computation amount by limiting the number of searching trees 

10 in an algebraic codebook in algebraic code excited linear' 
prediction (ACELP) vocoder using depth first tree method. 

In accordance with an aspect of the present invention, 
there is provided a method for searching an algebraic codebook 
in ACELP vocoding using a depth first tree method, including 

15 the steps of: a) searching at a predetermined level to predict 
a tree in which optimum pulse is located; b) choosing a 
predetermined number of trees according to the search result 
of the step a) and remove a residual trees; c) searching the 
chosen trees and choosing optimum algebraic code. 

20 

Brief Description of Drawings 

The above and other objects and features of the present 
invention will become apparent from the following description 
25 of the preferred embodiments given in conjunction with the 
accompanying drawings, in which: 

Fig.l is a block diagram showing encoding procedures of 
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an ACELP vocoder using a typical algebraic codebook; 

Fig. 2 is a flowchart showing a method for searching 
algebraic code in an algebraic, codebook in accordance with the 
present invention ; 
5 Fig. 3 is an exemplary diagram showing a tree having 

levels for searching an algebraic codebook in accordance with 
the present invention; 

Fig. 4 is an exemplary diagram showing maximum values in 
each track and a maximum value in total tracks in accordance 
10 with the present invention; 

Fig. 5 is an exemplary diagram showing fixation of pulses 
and searching of pulses in an algebraic codebook in accordance 
with the present invention; and 

Fig. 6 is an exemplary diagram showing search results of 
15 10 total pulses in accordance with the present invention. 

Detailed Description of the Invention 

Other objects and aspects of the invention will become 
20 apparent from the following description of the embodiments 
with reference to the accompanying drawings, which is set 
forth hereinafter. 

Fig. 2 is a flowchart showing a method for searching 
algebraic code in an algebraic codebook in accordance with the 
25 present invention. 

Referring to Fig. 2, at step 100, a tree is searched to a 
certain level by using the depth first tree search method to 
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predict an optimum location of a pulse. At step 200, adequate 
branches are chosen and residual branches are removed 
according to the search results of the step 100. At step 300, 
an optimum algebraic code is chosen. 
5 Fig. 3 is an exemplary diagram showing a tree having 

levels for searching an algebraic codebook in accordance with 
the present invention. 

Fig. 4 is an exemplary diagram showing maximum values in 
each track and a maximum value in total tracks in accordance 
10 with the present invention. 

Fig. 5 is an exemplary diagram showing fixation of pulses 
and searching of pulses in an algebraic codebook in accordance 
with the present invention. 

Fig. 6 is an exemplary diagram showing search results of 
15 10 total pulses in accordance with the present invention. 

First, b(n) is a sum of normalized backward filtered 
target signals and normalized long-term prediction residual 
signals. Maximum values of b(n) in each tracks are determined 
and stored in pos-max[ ] as shown in Fig. 4. 
20 Second, a global maximum, 31 in Fig. 4, is stored in 

ipos[0] and a location of the global maximum is stored in pos- 
max[ ipos [ ] ] . 

Third, first pulse, an iO is fixed as shown in 40 in Fig. 
5 and a second pulse, il, is fixed in a location of a maximum 
25 value in the next track as shown in 41 in Fig. 5. 

Forth, a maximum value is determined by searching two 
tracks, T3 and T4, for 8*8 times as shown in 42 and 43 in Fig. 
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5. 

Fifth, a pulse pair, i2 and i3, is chosen by rotating 
starting point of il. 

For example, if il is located in local maximum of T3, T2 
5 and T3 are searched for locations of i2 and i3. il 
subsequently changes a location from 32 to 33, 34 and 30 as 
shown in Fig. 4. Therefore, the number of search is 
4x (8x8)=256. 

Sixth, two large values, 22 and 23 in Fig. 3, are chosen 
10 by computation using Eq. 4 and residual branches that are not 
likely to be chosen are removed. 

Seventh, i4 and i5, i6 and i7, 18 and 19, are searched 
and determined according to the two chosen branches as shown 
in Fig. 6. The number of searching is 2x (3^ (8x8) ) =384 . 
15 Two branches are chosen at level 1 and residual branches 

are removed. The number of searching is total 640 times that 
sums 256 times at fifth step and 384 times at seventh step. 

However, 1024 times of searching are necessary in the 
prior method. Therefore, the present invention reduces 40% of 
20 computation amount . 

When the number of searching is generalized, the number 
of trees that are chosen is T and the level at which branches 
are chosen is L. Total searching is 4xLx (8x8) +Tx (4-L) x (8x8)- 
times that sums 4xLx(8x8) times and Tx (4-L) x ( 8x8 ) times. 
25 The computation result of searching is shown in Table 2. 
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[Table 2] 



Tree 


Level 0 


Level 1 


Level 2 


Level 3 


Level 4 


1 


256(25.0%) 


448(43.8%) 


640(62 .5%) 


832(81.3%) 


1024 ( 100%) 


2 


512(50.0%) 


640(62 .5%) 


768(75.0%) 


896(87.5%) 


1024(100%) 


3 


786(75.0%) 


832(81.3%) 


896(87.5%) 


960(93.8%) 


1024( 100%) 



For example, when two trees are chosen at level 2 to 
raise provability, total number of searching is 768 times and 
5 25% of computation is reduced. 

Also, when two trees are chosen at level 1, total number 
of searching is 64 0 times and 25% of computation amount is 
reduced . 

As mentioned above, the present invention can reduce 
10 complexity of computation as about 4 0% comparing to the^ 
conventional depth first tree search method. As the 
computation amount is reduced, a low price digital signal 
processing (DSP) chip is available to implement the ACELP 
algorithm and low power is consumed for the computation. 
15 Therefore, the method in accordance with the present invention 
provides compatibility for a potable vocoder by allowing more 
time to use the potable vocoder because the computation amount 
directly affects power consumption of the vocoder. 

While the present invention has been described with 
20 respect to certain preferred embodiment, it will be apparent 
to those skilled in the art that various changes and 
modifications may be made without departing from the scope of 
the invention as defined in the following claims. 
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